An Unsupervised Approach to Discovering and Disambiguating Social Media Profiles

نویسندگان

  • Carlton T. Northern
  • Michael L. Nelson
چکیده

Social media in the last decade has become a popular communication mechanism on the web. Sites like Facebook, Twitter and YouTube are seeing enormous growth. It is important to understand the trends of this new type of media for many reasons including identity theft, social engineering, advertising and digital preservation. Some data sets have been made available to the public such as the tweets from Twitter, alternately data can be scraped from the open web. However, to ascertain trends from a group of individuals such as employees of a business, or students of a university, there is no way, without asking each individual member, what social media sites they use. Within this paper, we present a detailed approach to gaining this type of information. Specifically, for a group of geographically and organizationally affiliated members, we present an unsupervised approach that can discover and disambiguate social media profiles with a precision of 0.863 and an F-measure of 0.654.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Knowledge Management Approach to Discovering Influential Users in Social Media

A key step for success of marketer is to discover influential users who diffuse information and their followers have interest to this information and increase to diffuse information on social media. They can reduce the cost of advertising, increase sales and maximize diffusion of information.  A key problem is how to precisely identify the most influential users on social networks. In this pape...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

CICBUAPnlp at SemEval-2016 Task 4-A: Discovering Twitter Polarity using Enhanced Embeddings

This paper presents our approach for SemEval 2016 task 4: Sentiment Analysis in Twitter. We participated in Subtask A: Message Polarity Classification. The aim is to classify Twitter messages into positive, neutral, and negative polarity. We used a lexical resource for pre-processing of social media data and train a neural network model for feature representation. Our resource includes dictiona...

متن کامل

Detecting Social Roles in Twitter

For social media analysts or social scientists interested in better understanding an audience or demographic cohort, being able to group social media content by demographic characteristics is a useful mechanism to organise data. Social roles are one particular demographic characteristic, which includes work, recreational, community and familial roles. In our work, we look at the task of detecti...

متن کامل

Understanding and Discovering Deliberate Self-harm Content in Social Media

Studies suggest that self-harm users found it easier to discuss self-harm-related thoughts and behaviors using social media than in the physical world. Given the enormous and increasing volume of social media data, on-line self-harm content is likely to be buried rapidly by other normal content. To enable voices of self-harm users to be heard, it is important to distinguish self-harm content fr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011